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Whole-Genome Sequencing and Comparative Analysis of Yersinia 
pestiSy the Causative Agent of a Plague Outbreak in Northern Peru 
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The plague is a zoonotic disease caused by the bacterium Yersinia pestis. Here, we report the complete genome sequence of the 
Y. pestis strain INS, which was isolated from swollen lymph gland aspirate (bubo aspirate) of an infected patient from a pneu- 
monic outbreak in 2010 in northern Peru. 
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Three biovars of Yersinia pestis have been characterized: Anti- 
qua, Medievalis, and Orientalis (1). The Antiqua and Medi- 
evalis are ancient strains that were responsible for the plague of 
Justinian and the Black Death, respectively. Orientalis is respon- 
sible for the modern pandemic and has been recognized all around 
the world. The genomes of several strains of Yersinia pestis (C092, 
KIM, and others) have been sequenced and annotated. 

A total of 5 fjig of DNA was sheared using a Covaris ultrasoni- 
cator, and a DNA library was prepared using the Fragment library 
kit according to the manufacturer's instructions (Applied Biosys- 
tems). The DNA library was sequenced in the SOLiD 3 Plus ge- 
nome analyzer, which yielded 250-fold coverage. Bioscope soft- 
ware (Applied Biosystems) was used to correct dibase miscalls 
prior to assembly. Mapping of the reads was achieved using Bow- 
tie 2.0 (2). The Y. pestis strain C092 (NCBI accession no. 
AL590842.1) genome sequence was used as a reference for reas- 
sembly, providing a total of 18 contigs. Consensus sequence was 
obtained with SAMtools (3) and Gap5 (4) for further analysis. For 
the genome annotation, we used Rapid Annotations using Sub- 
systems Technology (RAST) (5), and comparative analysis was 
done with the fully annotated genome of strain C092 (6) . We used 
SNP Finder (7) for single-nucleotide polymorphisms (SNP) and 
small indel detection between the reference and assembled ge- 
nomes. 

Genome-wide comparison of strain INS to several other genomes 
was performed using BLAST Ring Image Generator (BRIG) (8). The 
plasmids pCDl (NCBI accession no. AL117189.1), pMTl (NCBI 
accession no. AL117211.1), and pPCPl (NCBI accession no. 
ALl 09969. 1 ) were also used as references for the reassembly of the 
reads that had not been mapped. 

Y. pestis INS is 4.6 Mb long and 99.93% identical to Y. pestis 
C092. It has an average G+C content of 47.6%. Genome anno- 
tation found 4,2 14 protein-coding sequences, 19 rRNA genes, and 
69 tRNA genes. From these, 2,316 were assigned unequivocal 
functions, mainly in virulence factors, like invasins (n = 4), adhe- 
sion (n = 6), toxins (n = 17), invasion and intracellular resistance 
(m = 19), heme and iron uptake (n = 20), and resistance to anti- 



biotics (m = 29). We found 63 SNPs, none of which affected 
protein-coding sequences of the genes that were annotated. How- 
ever, 43 of these 63 SNPs might influence gene expression due to 
their location within the 5' or 3' untranslated regions (UTRs) (9). 
Additionally, we found three pathogenic plasmids: pCDl, pMTl, 
and pPCPl. The first one carries a complete type IV secretory 
system, which increases its capacity for cell invasion and defense 
neutralization. The second one holds the murine toxin (MT), 
which allows the bacterium to survive at 37°C. The last one has a 
plasminogen activator gene that helps to evade the immunological 
response. Moreover, pCDl had one deletion of 1 kb and 2 SNPs. 
This deletion was found next to the IS i 00 element, which counts 
for significant recombination rates through horizontal gene trans- 
fer (10). On the other hand, pMTl had 4 deletions and 2 SNPs. It 
is not yet clear whether these one-gene deletions correspond to 
highly mobile genetic elements. 

Y. pestis INS has similarities to La Paz strain, mainly in the 
presence of a multidrug translocase gene (11) that confers in- 
creased antibiotic resistance. Our findings confirm the presence of 
the l.ORI group in our country and the expansion of the biovar 
Orientalis (12). 

Nucleotide sequence accession numbers. This Whole Genome 
Shotgun project has been deposited at DDBJ/EMBL/GenBank under 
the accession no. AMQLOOOOOOOO. The version described in this 
paper is the first version, AMQLOIOOOOOO. 
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